A Labelled Corpus for Prepositional Phrase Attachment

نویسندگان

  • Brian Mitchell
  • Robert J. Gaizauskas
چکیده

This paper describes a labelled corpus intended for training learning algorithms to attach prepositional phrases (PPs). Taken from the PTB2, we believe it is the largest available resource for this purpose, especially as it contains many patterns in which PPs occur ambiguously (nearly all previous research has focused on just one pattern) and we present some results for the five most common patterns. Moreover, the corpus contains some features that, to our knowledge, have not been used before for attaching PPs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prepositional Phrase Attachment Ambiguity Resolution Using Semantic Hierarchies

This paper describes a system that resolves prepositional phrase attachment ambiguity in English sentence processing. This attachment problem is ubiquitous in English text, and is widely known as a place where semantics determines syntactic form. The decision is made based on a four-tuple composed of the head verb of the verb phrase, the head noun of the noun phrase, and the preposition and hea...

متن کامل

An Unsupervised Approach to Prepositional Phrase Attachment using Contextually Similar Words

Prepositional phrase attachment is a common source of ambiguity in natural language processing. We present an unsupervised corpus-based approach to prepositional phrase attachment that achieves similar performance to supervised methods. Unlike previous unsupervised approaches in which training data is obtained by heuristic extraction of unambiguous examples from a corpus, we use an iterative pr...

متن کامل

Statistical Models for Unsupervised Prepositional Phrase Attachment

We present several unsupervised statistical models for the prepositional phrase attachment task that approach the accuracy of the best supervised methods for this task. Our unsupervised approach uses a heuristic based on attachment proximity and trains h'om raw text that is annotated with only part-oi;speech tags and morphologicM base forms, as opposed to attachment information. It is therefore...

متن کامل

Prepositional Phrase Attachment and Interlingua

In this paper, we present our work on the classical problem of prepositional phrase attachment. This forms part of an interlingua based machine translation system, in which the semantics of the source language sentences is captured in the form of Universal Networking Language (UNL) expressions. We begin with a thorough linguistic analysis of six common prepositions in English, namely, for, from...

متن کامل

A Rule-Based Approach to Prepositional Phrase Attachment Disambiguation

I:n this paper, we describe a new corpus-based approach to prepositional phrase a t t achment disambiguation, and present results colnparing peffo> mange of this a lgori thm with other corpus-based approaches to this problem.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004